{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "*** Introductory Examples for the NLTK Book ***\n",
      "Loading text1, ..., text9 and sent1, ..., sent9\n",
      "Type the name of the text or sentence to view it.\n",
      "Type: 'texts()' or 'sents()' to list the materials.\n",
      "text1: Moby Dick by Herman Melville 1851\n",
      "text2: Sense and Sensibility by Jane Austen 1811\n",
      "text3: The Book of Genesis\n",
      "text4: Inaugural Address Corpus\n",
      "text5: Chat Corpus\n",
      "text6: Monty Python and the Holy Grail\n",
      "text7: Wall Street Journal\n",
      "text8: Personals Corpus\n",
      "text9: The Man Who Was Thursday by G . K . Chesterton 1908\n",
      "Displaying 11 of 11 matches:\n",
      "ong the former , one was of a most monstrous size . ... This came towards us , \n",
      "ON OF THE PSALMS . \" Touching that monstrous bulk of the whale or ork we have r\n",
      "ll over with a heathenish array of monstrous clubs and spears . Some were thick\n",
      "d as you gazed , and wondered what monstrous cannibal and savage could ever hav\n",
      "that has survived the flood ; most monstrous and most mountainous ! That Himmal\n",
      "they might scout at Moby Dick as a monstrous fable , or still worse and more de\n",
      "th of Radney .'\" CHAPTER 55 Of the Monstrous Pictures of Whales . I shall ere l\n",
      "ing Scenes . In connexion with the monstrous pictures of whales , I am strongly\n",
      "ere to enter upon those still more monstrous stories of them which are to be fo\n",
      "ght have been rummaged out of this monstrous cabinet there is no telling . But \n",
      "of Whale - Bones ; for Whales of a monstrous size are oftentimes cast up dead u\n"
     ]
    }
   ],
   "source": [
    "import nltk\n",
    "from nltk.book import  *\n",
    "text1.concordance(\"monstrous\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Displaying 11 of 11 matches:\n",
      "ong the former , one was of a most monstrous size . ... This came towards us , \n",
      "ON OF THE PSALMS . \" Touching that monstrous bulk of the whale or ork we have r\n",
      "ll over with a heathenish array of monstrous clubs and spears . Some were thick\n",
      "d as you gazed , and wondered what monstrous cannibal and savage could ever hav\n",
      "that has survived the flood ; most monstrous and most mountainous ! That Himmal\n",
      "they might scout at Moby Dick as a monstrous fable , or still worse and more de\n",
      "th of Radney .'\" CHAPTER 55 Of the Monstrous Pictures of Whales . I shall ere l\n",
      "ing Scenes . In connexion with the monstrous pictures of whales , I am strongly\n",
      "ere to enter upon those still more monstrous stories of them which are to be fo\n",
      "ght have been rummaged out of this monstrous cabinet there is no telling . But \n",
      "of Whale - Bones ; for Whales of a monstrous size are oftentimes cast up dead u\n"
     ]
    }
   ],
   "source": [
    "text1.concordance(\"monstrous\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Displaying 25 of 25 matches:\n",
      "teach them by what name a whale - fish is to be called in our tongue leaving out\n",
      "n of the word , you deliver that which is not true .\" -- HACKLUYT \" WHALE . ... \n",
      " ... Sw . and Dan . HVAL . This animal is named from roundness or rolling ; for \n",
      "ndness or rolling ; for in Dan . HVALT is arched or vaulted .\" -- WEBSTER ' S DI\n",
      "EBSTER ' S DICTIONARY \" WHALE . ... It is more immediately from the Dut . and Ge\n",
      "- JONAH . \" There go the ships ; there is that Leviathan whom thou hast made to \n",
      "nt ; and he shall slay the dragon that is in the sea .\" -- ISAIAH \" And what thi\n",
      "fly , let us fly ! Old Nick take me if is not Leviathan described by the noble p\n",
      "H .\" \" The sovereignest thing on earth is parmacetti for an inward bruise .\" -- \n",
      "FACE TO GONDIBERT . \" What spermacetti is , men might justly doubt , since the l\n",
      "ATTLE OF THE SUMMER ISLANDS . \" By art is created that great Leviathan , called \n",
      "r State --( in Latin , Civitas ) which is but an artificial man .\" -- OPENING SE\n",
      " S ANNUS MIRABILIS . \" While the whale is floating at the stern of the ship , th\n",
      "hale can ' t open his mouth , but that is a fable . ... They frequently climb up\n",
      "sort that was killed by any man , such is his fierceness and swiftness .\" -- RIC\n",
      "729 . \"... and the breath of the whale is frequendy attended with such an insupp\n",
      "emptible in the comparison . The whale is doubtless the largest animal in creati\n",
      "acetti Whale found by the Nantuckois , is an active , fierce animal , and requir\n",
      ". \" And pray , sir , what in the world is equal to it ?\" -- EDMUND BURKE ' S REF\n",
      "ng the seas from pirates and robbers , is the right to royal fish , which are wh\n",
      "LL SIZED ONE .) \" The aorta of a whale is larger in the bore than the main pipe \n",
      "aring in its passage through that pipe is inferior in impetus and velocity to th\n",
      ".\" -- PALEY ' S THEOLOGY . \" The whale is a mammiferous animal without hind feet\n",
      "r whale than this In the vast Atlantic is ; Not a fatter fish than he , Flounder\n",
      "rved : there -- pointing to the sea -- is a green pasture where our children ' s\n"
     ]
    }
   ],
   "source": [
    "text1.concordance(\"is\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Displaying 25 of 25 matches:\n",
      ", however , and , as a mark of his affection for the three girls , he left them\n",
      "t . It was very well known that no affection was ever supposed to exist between\n",
      "deration of politeness or maternal affection on the side of the former , the tw\n",
      "d the suspicion -- the hope of his affection for me may warrant , without impru\n",
      "hich forbade the indulgence of his affection . She knew that his mother neither\n",
      "rd she gave one with still greater affection . Though her late conversation wit\n",
      " can never hope to feel or inspire affection again , and if her home be uncomfo\n",
      "m of the sense , elegance , mutual affection , and domestic comfort of the fami\n",
      ", and which recommended him to her affection beyond every thing else . His soci\n",
      "ween the parties might forward the affection of Mr . Willoughby , an equally st\n",
      " the most pointed assurance of her affection . Elinor could not be surprised at\n",
      "he natural consequence of a strong affection in a young and ardent mind . This \n",
      " opinion . But by an appeal to her affection for her mother , by representing t\n",
      " every alteration of a place which affection had established as perfect with hi\n",
      "e will always have one claim of my affection , which no other can possibly shar\n",
      "f the evening declared at once his affection and happiness . \" Shall we see you\n",
      "ause he took leave of us with less affection than his usual behaviour has shewn\n",
      "ness .\" \" I want no proof of their affection ,\" said Elinor ; \" but of their en\n",
      "onths , without telling her of his affection ;-- that they should part without \n",
      "ould be the natural result of your affection for her . She used to be all unres\n",
      "distinguished Elinor by no mark of affection . Marianne saw and listened with i\n",
      "th no inclination for expense , no affection for strangers , no profession , an\n",
      "till distinguished her by the same affection which once she had felt no doubt o\n",
      "al of her confidence in Edward ' s affection , to the remembrance of every mark\n",
      " was made ? Had he never owned his affection to yourself ?\" \" Oh , no ; but if \n"
     ]
    }
   ],
   "source": [
    "text2.concordance(\"affection\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 11,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "true contemptible christian abundant few part mean careful puzzled\n",
      "mystifying passing curious loving wise doleful gamesome singular\n",
      "delightfully perilous fearless\n"
     ]
    }
   ],
   "source": [
    "text1.similar(\"monstrous\")"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# show length\n"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "260819"
      ]
     },
     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "len(text1)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 6,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "very so exceedingly heartily a as good great extremely remarkably\n",
      "sweet vast amazingly\n"
     ]
    }
   ],
   "source": [
    "text2.similar(\"monstrous\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 7,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "a_pretty am_glad a_lucky is_pretty be_glad\n"
     ]
    }
   ],
   "source": [
    "text2.common_contexts([\"monstrous\",\"very\"])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      ",_it ,_the and_the i_a it_the ._it you_a ,_seen but_all ,_that and_in\n",
      ",_sure for_this whale_a ._this\n"
     ]
    }
   ],
   "source": [
    "text2.common_contexts([\"as\",\"be\"])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 12,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "here me you u too about down caps black town listening listen kent\n",
      "litening\n",
      "im_to\n"
     ]
    }
   ],
   "source": [
    "#similar, common_context\n",
    "text5.similar(\"happy\")"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 14,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "am_you\n"
     ]
    }
   ],
   "source": [
    "text5.common_contexts([\"happy\",\"too\"])"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 16,
   "metadata": {
    "scrolled": true
   },
   "outputs": [
    {
     "data": {
      "image/png": "iVBORw0KGgoAAAANSUhEUgAAAZ4AAAEWCAYAAABWn/G6AAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDIuMi4yLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvhp/UCwAAIABJREFUeJzt3Xu8XFV99/HPNwSMCCTcKiAkB6iKiBjJAYWi52DxAkWKT1GgWomFItYbWmrxiZqDD7YgVMFLBeqjkRZQRGhT1IJiI4qCJNwCCOUWNHIXgwSQ669/7LU5Ozszc2bmzKxzTvN9v17zmj1rrb3Wb6/ZM7/sS+YoIjAzM8tl2kQHYGZm6xYnHjMzy8qJx8zMsnLiMTOzrJx4zMwsKyceMzPLyonH1lmSvifp8HH2MV/ST8bZx42ShsfTRy/1Yl66GHNE0r/mHNMmjhOPTQmSVkjat5d9RsR+EfH1XvZZJWlAUkhanR73SbpI0htqcbw8Ipb0K45O9WteJC2S9GSai4ckfV/STl300/N9wfJy4jHrv1kRsRHwSuD7wIWS5k9UMJKmT9TYwGfSXGwL3A8smsBYbII48diUJ+kASddKWiXpp5J2TeU7pn9Z75ZebyPpwfK0lqQlko6s9PNXkn4h6RFJN1XWO07S7ZXyt3YTZ0TcGxGnASPASZKmpf6f+xe8pD0kLZX0u3SE9NlUXh49HSXpbkn3SPqbSuzTKnH+RtJ5kjarrXuEpF8CP5Q0Q9K/prarJF0l6YX1eUn9flzSXZLul3SWpJm1fg+X9Ms0twvanIvHgHOAXRrVSzownYJcleJ5WSr/F2A28B/pyOmjnb4PNvGceGxKS8nhq8B7gM2BM4DFkp4XEbcDfwecLWlD4GvAokantSS9jSIhvAvYBDgQ+E2qvh14LTATOB74V0lbjyPsC4A/AF7aoO404LSI2ATYETivVr8P8GLgjcBxlVNOHwQOAoaAbYDfAl+qrTsEvAx4E3B42p7tKObtaODxBvHMT499gB2AjYAv1trsnbblj4FPlkmiFUkbAe8ArmlQ9xLgXOAYYEvguxSJZoOI+Avgl8BbImKjiPjMWGPZ5OPEY1PdXwFnRMSVEfFMujbxBPAagIj4Z+BW4Epga6DZv8iPpDgNdFUUbouIu1If34qIuyPi2Yj4Zupvj3HEfHd63qxB3VPAH0raIiJWR8QVtfrjI+LRiFhOkUgPS+XvARZExMqIeIIiiR5cO602ktZ9PI2zOfCHad6WRcTvGsTzDuCzEXFHRKwGPgYcWuv3+Ih4PCKuA66jOKXYzLGSVgG3USSx+Q3aHAJ8JyK+HxFPAacAzwf2atGvTSFOPDbVzQH+Jp2SWZW+1Laj+Fd/6Z8pTul8IX0pN7IdxZHNWiS9q3Iqb1Xqa4txxPyi9PxQg7ojgJcAN6fTXwfU6n9VWb6L0e2cQ3HtqIzxF8AzwAubrPsvwMXAN9Kpu89IWr9BPNukcapjTq/1e29l+TGKhNLMKRExKyK2iogD01FpyzEj4tkU+4satLUpyInHprpfAZ9OX2blY8OIOBeeO6VzKvD/gZHyukeTfnasF0qaQ5G43g9sHhGzgBsAjSPmt1JcWL+lXhERt0bEYRSn4k4Czpf0gkqT7SrLsxk9evoVsF9tHmZExK+r3VfGeSoijo+InSmOJA6gOM1YdzdFUquO+TRwX5vb2o01xpQkiu0ut8U/qT/FOfHYVLJ+uihePqZTJIWjJb1ahRdI+hNJG6d1TgOWRcSRwHeA05v0/RWK00DzUj9/mJLOCyi+6B4AkPRumlwQH4ukF0p6P7AQ+Fj6l3y9zTslbZnqVqXiZypNPiFpQ0kvB94NfDOVnw58OsWMpC0l/WmLWPaR9ApJ6wG/ozj19kyDpucCH5a0fUrifw98MyKe7mTbO3Qe8CeS/jgdhf0NxenTn6b6+yiuN9kU5cRjU8l3KS6Al4+RiFhKcZ3nixQX1G8jXTdIX7xvprhwDvARYDdJ76h3HBHfAj5NcafVI8C/AZtFxE3APwI/o/jCewVweYdxr5L0KLAc2B94W0R8tUnbNwM3SlpNkTQPjYjfV+p/lLbxUorTVpek8tOAxcAlkh4BrgBe3SKmrYDzKZLOL1K/jf4D51cpTstdBtwJ/B74QOvNHZ+IuAV4J/AF4EHgLRQ3EzyZmvwD8PF0WvHYfsZi/SH/ITizyU/SAMUX//p9Ptow6zsf8ZiZWVZOPGZmlpVPtZmZWVY+4jEzs6wm8scCJ60tttgiBgYGJjoMM7MpZdmyZQ9GxJZjtXPiaWBgYIClS5dOdBhmZlOKpLvGbuVTbWZmlpkTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZTVjikTha4l1peb7ENpW6r0jsPFGxmZlZ/0xY4ong9AjOSi/nw2jiieDICG6akMB6YGAARkaK5ZGRNR/Dw2svDwyMLg8Pjy6XyjZlfbXfsn113EaPgYHR9arr1FXL6/1Wx623rZaXr5sZHoYZM4p+Z8wYbTtrVlFXxlrtp1pen4dGsdRjqK5T36Zy/Xa3oVG/1XWr816PrRy7bFfd1mqf9Vjq5c3aNNoH6nGPtY31mFq1b7aNzdZvFkejdaZPHzv+scYqyxqV12Me6/0eGSn20XbGa6bRfloul3XVz387810vKz/f1X2h+jmrftdMFEVEnoGKo5tjgQCuB24HVgMrgEXAr4HHgT2B76W22wCfSl08H9gggu0l5gGfBTYCHgTmR3CPxBLgSmAfYBZwRAQ/lng58DVgA4pk+2cR3Nos1sHBwVi6dOl4thWAiNHlbpRvTb2PRv22M1a9TaO3Xlpz3Oo6jZbrMdbXbaRRnM1iazeGZnG3iq/R63a2oVG/rbaj3k+jbar3X4+l0VjtthlrXhqN3yjeVnVjvd/drNNO/GONVe2rk+1s1K5RP+3MX7O6Zvtns89YPZZm89Lqe6Cd74DxkLQsIgbHapfliCd98S8AXh/BK4EPlXURnA8sBd4RwdwIHq/ULU5lc4HrgFMk1ge+ABwcwTzgq8CnK8NNj2AP4BhgYSo7Gjgt9TMIrOzXtpqZWWvTM43zeuD8CB4EiOChTo4EJD4KPB7BlyR2AXYBvp/6WA+4p9L8gvS8DBhIyz8DFkhsC1zQ6GhH0lHAUQCzZ89uPzgzM+tIrms8ojjF1vmK4o+Bt1EctZR93VgeCUXwigjeWFnlifT8DCmxRnAOcCDFqbyLJV5fHycizoyIwYgY3HLLLbsJ1czM2pAr8VwKvF1icwCJzWr1jwAb11eSmAP8E/D2yim4W4AtJfZMbdZPp/KaktgBuCOCzwOLgV3HszFmZta9LKfaIrhR4tPAjySeAa6huKmgtAg4XXru5oLSfGBz4MJ0Wu3uCPaXOBj4vMRMim04FbixRQiHAO+UeAq4l9EbFvpizhyYP79YXrhwzbolS0bvJimXFy0avbNtyZKirnrHycKFRZv580fry37L1/Vx6xYtGl2vuk7d0NDay822p9q2vp311/UxrrgCttoK7r0XjjuuKJ85E+bOhRUr1u5naGi0vLoN1fr6OtUY6ttdn6uhobXv8mm2DY36rb9fVdXYyrHLduX70um4zbTaB5rNS6M+mo3b7H2ub2Oz9ZvF0WidE06Aj3+88/XaKYO1Yx5r3hcuhFNP7X68+pjN9s/qvtjOfNfLyucVK0b3hRNPHP2clf23+h7ot2x3tU0l472rzcxsXTSp7mozMzMrOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXVceKRGJE4th/B2MQYGeltP930NzLSuzhs4g0M9P79bNbfyAjMmDG6D42MwLRpo3XDw6PL1T7K8l7H0yvDw40/F+XcNtq2avvqes2WJ4oiorMVxAiwOoJT+hLR2ONPj+Dpfo4xODgYS5cu7ecQk4oEHe4GLfvppj+peO5FHDbx+vF+NtuvyrHqyn2x0XKr/sYbT69Ut6s6Tn17q9tWL2u0rf2MW9KyiBgcq11bRzwSCyRukfgB8NJUtqPEf0osk/ixxE6pfJHElyX+S+IOiSGJr0r8QmJRpc/DJJZL3CBxUqX8zRJXS1wncWkqG5E4U+IS4CyJgTTm1emxV2X9j6Z+r5M4McV5daX+xRLL2tluMzPrveljNZCYBxwKvCq1vxpYBpwJHB3BrRKvBv4JeH1abdO0fCDwH8AfAUcCV0nMBe4HTgLmAb8FLpE4CLgc+GfgdRHcKbFZJZR5wN4RPC6xIfCGCH4v8WLgXGBQYj/gIODVETwmsVkED0k8LDE3gmuBd8NoAhzdTh0FHAUwe/bstibPzMw6N2biAV4LXBjBYwASi4EZwF7AtyqHeM+rrPMfEYTEcuC+CJandW8EBoA5wJIIHkjlZwOvA54BLovgToAIHqr0uTiCx9Py+sAXUxJ7BnhJKt8X+FoZa2X9rwDvlvgIcAiwR30jI+JMimTK4OCgT/iYmfVJO4kHoP5FPA1YFcHcJu2fSM/PVpbL19Oh6TUaNRir9Ghl+cPAfcArUyy/H2P9bwMLgR8CyyL4TZMxzMysz9pJPJcBiyROTO3fApwB3Cnxtgi+JSFg1wiua3PcK4HTJLagONV2GPAF4GfAlyS2L0+11Y56SjOBlRE8K3E4sF4qvwT4pMQ51VNt6ZTcxcCXgSPajHGdsXBhb/vppr9exWCTw5w5MH9+b/tsto8sXAgnngjHHTda9qlPjdYtWdJ4/aGh/sTTK0NDje+8q85ts22rlzVbniht3dUmsQB4F3AXsBK4ieIo4svA1hSnvr4RwafSDQQXRXC+xEBa3iX1U637c+BjFEcp343go6nNfsDfUxzJ3B/BG+p30qXrOt8GHgP+C/hABBuluuNSrE+mfv9vKn9NWmd2BM+02t517a42M7NeaPeuto5vp56q0v89mhnBJ8Zq68RjZta5dhNPu9d4pjSJC4EdGb3rzszMJsg6kXgieOtEx2BmZgX/VpuZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXVt8Qj8UGJX0ic3eN+RySO7WWfE2lgAEZGRl+PjKz9uh+q4wwPrz1OL8ct+xoZKcZq1abevtM4yv77NW+dKOMfGFizrFX7Tso7aVed+072r3bGrr6n5TbXy0qzZjUfo9n73qyvsWIqH/XX5Xsya9aaY5ZlUCwPD8OMGWt+PqqxVOezHm99W+rzX9ZXYyjHLtuWddX9p1pfLtdVx6u2HRiA6dOLbZoxo1huFH+Oz44ioj8di5uB/SK4s1I2PYKnx9nvCLA6glPGGWJTg4ODsXTp0n51vwapeC7fhkav+/EWVcepj9nrccu+Go3TaLx22rczVp927Y5iKbXzfjara3dbxuq7jKPRXHfTZ6M2Y21zo/7q71mrz0Anc1GNo/q6rl7frH2zuWv1Ga73U2/f6H1pNnZ925rt561ib1Xei+8cScsiYnCsdn054pE4HdgBWCzxsMSZEpcAZ0msJ3GyxFUS10u8p7Le31bKj6+UL5C4ReIHwEsr5XMlrkjtL5TYNJUvkficxGXpqGt3iQskbpU4oR/bbGZm7elL4ongaOBuYB/gc8A84E8j+HPgCODhCHYHdgf+SmJ7iTcCLwb2AOYC8yReJzEPOBR4FfB/0jqls4C/i2BXYDmwsFL3ZASvA04H/h14H7ALMF9i83rMko6StFTS0gceeKBnc2FmZmuanmmcxRE8npbfCOwqcXB6PZMi4bwxPa5J5Rul8o2BCyN4DEBicXqeCcyK4Eep/deBb1XHTM/LgRsjuCetdwewHfCbaoARcSZwJhSn2sa7wWZm1liuxPNoZVnAByK4uNpA4k3AP0RwRq38GKCbRPBEen62sly+zrXdZmZWMxFfwBcD75X4YQRPSbwE+HUq/38SZ0ewWuJFwFPAZcAiiRNTvG8BzojgYYnfSrw2gh8DfwHPHf1MGXPmwPz5o68XLlyzvv66V6r9Dg2tfbdZL8ct+1q4EJYsGTueavtODQ11v26vlTEsWrR2Wav27ZZ30q46943mups+S+WcV9tX3+dqHzNnNh+j2fveqP9OYqq/Hh4u3pNVq+CYY0b7Lcug+FwODMAVV8BrXjP6+aj202xfq36eqnWN5n/mzNEYyrkp36trry3qqvtPtb7R2NWyJUuKOMq2c+bAypWjd7M9/XTjPnJ8dvp5V9sKYBB4P5W70CSmASdQJBABDwAHpUTyIeDI1MVq4J0R3C6xAHgXcBewErgpglMk5lJcw9kQuAN4dwS/lVgCHBvBUonhtHxAGv+5umax57yrzczsf4t272rrW+KZypx4zMw6N6G3U5uZmTXjxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVaTLvFIjEgc26J+rsT+ldcHShyXJ7r2jIy0rh8eXrtt+VzWDQ+vXQcwa1bj/hu1bVVX76c6/sjI6KOs62Sb2tEqzoGB5uvMmrV222rM9fat5qVReavtbFbX6P1spdqm2b5Qff+b6WTc6n5VfS7nqN33r53t63adbvrudQz/W8eHsT8L0PnnuFuKiDwjtUliBFgdwSlN6ucDgxG8v18xDA4OxtKlS7teX4JW01qtL5cbPcOaZWX7srxRn43GblRX76c+bqkeS7fb3E77VttQj7lVvK3ajxVHq+1op4925qFZ+0bb1Ol+NFbbZvvZWGM1GrNd7a7TTd+9jqFfJnr8agzd7OPtj6FlETE4VrtJccQjsUDiFokfAC9NZUskBtPyFhIrJDYAPgUcInGtxCES8yW+mNptKfFtiavS449S+VBqf63ENRIbT9Cmmpmt86ZPdAAS84BDgVdRxHM1sKxR2wielPgklSOedARUOg34XAQ/kZgNXAy8DDgWeF8El0tsBPx+7Th0FHAUwOzZs3u0dWZmVjfhiQd4LXBhBI8BSCweR1/7AjtXTh9sko5uLgc+K3E2cEEEK+srRsSZwJlQnGobRwxmZtbCZEg8AI2+6J9m9FTgjDb7mQbsGcHjtfITJb4D7A9cIbFvBDd3F6qZmY3HZEg8lwGLJE6kiOctwBnACmAe8HPg4Er7R6DpNZpLgPcDJ0NxB1wE10rsGMFyYLnEnsBO0L/Es3Bh6/qhobXbls9l3dDQ6B0m1f5mzoRjjmk+ZqOxG9XV+6mOX7+zZaztqcbdrlZxzpnTfJ1TT127bXlnW6sxmm1DvbzVtjara/R+tlJt02xfWLJk7DuMOhm3ul9Vn8v1lixpvX6744xnnW767nUM/1vHr8bQKpZOP8fdmhR3tUksAN4F3AWsBG4CLgLOA1YDPwTeGcGAxGYU127WB/4BeD7pmo/EFsCXKK7rTAcui+BoiS8A+wDPpL7nR/BEs3jGe1ebmdm6qN272iZF4plsnHjMzDo3pW6nNjOzdYcTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWfU18Ui8VSIkdupT/4MSn+9H32Zm1h/9PuI5DPgJcGivO5aYHsHSCD7Y6757aXgYRkaK5ZGR4nUO5Zj15V72O9VNxLZMlfkbGYGBgc7WGR4u1hkeHt3vy37K1+X+X/1cVA0MrF1e9lfG1Srm+nozZow95/X6WbPWjKdVLNXXZf20aaPbMX160V85F9V2ZR/lGGV9df6q81idv2nTin6nTy/Wr897WT5jxmgsw8MgjbafNWvN96Ycv9P3vRuKiP50LDYCbgH2ARZHsJPEMHA8cB8wF7gAWA58CHg+cFAEt0tsCZwOzE7dHRPB5RIjwDbAAPAgcCZwbAQHpPG+AAwCARwfwbclvgzsnvo/P4KFY8U+ODgYS5cu7cEsFG80QMSay/0mjY5TXe5lv1PdRGzLVJm/bvbVcp2xtPosNCqvf4aaxVSvq8bTajsardfss9Po81ztf6w5qPfbaNva6aPdue7UWHM8FknLImJwrHbTu+u+LQcB/xnBf0s8JLFbKn8l8DLgIeAO4CsR7CHxIeADwDHAacDnIviJxGzg4rQOwDxg7wgeT4ms9Ang4QheASCxaSpfEMFDEusBl0rsGsH1fdtqMzNrqZ+J5zDg1LT8jfT6O8BVEdwDIHE7cElqs5zi6AhgX2DnSlbfRGLjtLw4gscbjLcvlVN6Efw2Lb5d4iiKbd0a2BnWTjySjgKOApg9e3a92szMeqQviUdic+D1wC4SAaxHcfrru8ATlabPVl4/W4lnGrBnPcGkRPRos2HTGNX22wPHArtH8FuJRcCMRitHxJkUp+4YHBycAidDzMympn7dXHAwcFYEcyIYiGA74E5g7zbXvwR4f/lCYm4X62wKbEKRqB6WeCGwX5vjm5lZn/TrVNthwIm1sm8D7wVub2P9DwJfkrieIsbLgKPHWOeEtM4NwDMUNxdcIHENcCPF9aTL29+E3hgaGr0TZeFCWLIkz7gLFzZe7mW/U91EbMtUmb+FC2HRos7WGRqCFStG74oq9/tFi0bvnir3/+rnomrOHJg/f+1+q3G1irm+3hVXwHHHtY67vt7MmWvG0yqW+ud5aAguuwxmzy6244QTYKON4JhjGvdTnecyjrKunK/qPJXjSbDJJrB6NWy7bVFWnfcTTijK770XttqqiGXJEvjRj0a3adWqNe9sa7bN/dC3u9qmsl7e1WZmtq5o9642/3KBmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVZOPGZmlpUTj5mZZeXEY2ZmWTnxmJlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWXlxGNmZlk58ZiZWVaKiImOYdKR9ABwV5erbwE82MNw+sVx9pbj7J2pECM4zkbmRMSWYzVy4ukxSUsjYnCi4xiL4+wtx9k7UyFGcJzj4VNtZmaWlROPmZll5cTTe2dOdABtcpy95Th7ZyrECI6za77GY2ZmWfmIx8zMsnLiMTOzrJx4ekjSmyXdIuk2ScdlGG87Sf8l6ReSbpT0oVS+maTvS7o1PW+ayiXp8ym+6yXtVunr8NT+VkmHV8rnSVqe1vm8JI0j3vUkXSPpovR6e0lXpjG/KWmDVP689Pq2VD9Q6eNjqfwWSW+qlPdk7iXNknS+pJvTvO45GedT0ofTe36DpHMlzZgM8ynpq5Lul3RDpazv89dsjA5iPDm959dLulDSrG7nqJv3od04K3XHSgpJW0zkXHYtIvzowQNYD7gd2AHYALgO2LnPY24N7JaWNwb+G9gZ+AxwXCo/DjgpLe8PfA8Q8BrgylS+GXBHet40LW+a6n4O7JnW+R6w3zji/QhwDnBRen0ecGhaPh14b1r+a+D0tHwo8M20vHOa1+cB26f5Xq+Xcw98HTgyLW8AzJps8wm8CLgTeH5lHudPhvkEXgfsBtxQKev7/DUbo4MY3whMT8snVWLseI46fR86iTOVbwdcTPGf3LeYyLns+rug1x2uq4/0Bl5cef0x4GOZY/h34A3ALcDWqWxr4Ja0fAZwWKX9Lan+MOCMSvkZqWxr4OZK+RrtOoxtW+BS4PXARWlnf7DyYX9u/tKHas+0PD21U31Oy3a9mntgE4ovdNXKJ9V8UiSeX6Uvk+lpPt80WeYTGGDNL/W+z1+zMdqNsVb3VuDsRts+1hx1s193GidwPvBKYAWjiWfC5rKbh0+19U75ZVBamcqySIftrwKuBF4YEfcApOc/GCPGVuUrG5R341Tgo8Cz6fXmwKqIeLpB38/Fk+ofTu07jb9TOwAPAF9TcUrwK5JewCSbz4j4NXAK8EvgHor5Wcbkm89SjvlrNkY3/pLiCKCbGLvZr9sm6UDg1xFxXa1qss5lQ048vdPoXH2We9UlbQR8GzgmIn7XqmmDsuiivNP4DgDuj4hlbcTSqq6vcVL8K3Q34MsR8SrgUYpTDc1M1HxuCvwpxamfbYAXAPu16Hui5nMsky4uSQuAp4Gzy6IOY+lmv243tg2BBcAnG1V3GM+EfV+BE08vraQ491raFri734NKWp8i6ZwdERek4vskbZ3qtwbuHyPGVuXbNijv1B8BB0paAXyD4nTbqcAsSdMb9P1cPKl+JvBQF/F3aiWwMiKuTK/Pp0hEk20+9wXujIgHIuIp4AJgLybffJZyzF+zMdqWLrwfALwj0nmmLmJ8kM7fh3btSPGPjevSZ2lb4GpJW3URZ1/ncky9Pne3rj4o/rV8B8WOUV5sfHmfxxRwFnBqrfxk1rw4+Jm0/Cc6DuV/AAAEcUlEQVSseQHy56l8M4prG5umx53AZqnuqtS2vAC5/zhjHmb05oJvseZF2L9Oy+9jzYuw56Xll7Pmhd47KC7y9mzugR8DL03LI2kuJ9V8Aq8GbgQ2TP18HfjAZJlP1r7G0/f5azZGBzG+GbgJ2LLWruM56vR96CTOWt0KRq/xTNhcdvU563WH6/KD4s6S/6a422VBhvH2pjg8vh64Nj32pzhvfClwa3oudzQBX0rxLQcGK339JXBbery7Uj4I3JDW+SJjXAxtI+ZhRhPPDhR31tyWPqzPS+Uz0uvbUv0OlfUXpFhuoXJHWK/mHpgLLE1z+m/pwzrp5hM4Hrg59fUvFF+MEz6fwLkU152eovhX9RE55q/ZGB3EeBvFtZDyc3R6t3PUzfvQbpy1+hWMJp4JmctuH/7JHDMzy8rXeMzMLCsnHjMzy8qJx8zMsnLiMTOzrJx4zMwsKycesy5J+pykYyqvL5b0lcrrf5T0kXH0PyLp2CZ1R6VfU75Z0s8l7V2pe62KX66+VtLz0y8v3yjp5A7HH5D0593Gb9aME49Z935K8YsBSJoGbEHxHw5LewGXt9ORpPXaHTT9BNF7gL0jYifgaOCc9D/YAd4BnBIRcyPi8dR2t4j423bHSAYAJx7rOSces+5dTko8FAnnBuARSZtKeh7wMuCa9LdSTlbxt3OWSzoEQNKwir+ndA7Ff/pD0oL0N15+ALy0ybh/B/xtRDwIEBFXU/x6wfskHQm8HfikpLMlLab4LbcrJR0i6W0pjuskXZbGXC/Fd1X6Wy7vSeOcCLw2HTl9uJcTZ+u26WM3MbNGIuJuSU9Lmk2RgH5G8Qu/e1L88vD1EfGkpD+j+EWEV1IcFV1VfukDewC7RMSdkuZR/JTKqyg+m1dT/Op03csblC8FDo+IT6TTbhdFxPkAklZHxNy0vBx4U0T8WqN/7OwI4OGI2D0lzMslXULxcynHRsQB45spszU58ZiNT3nUsxfwWYrEsxdF4vlparM3cG5EPEPxA4w/AnYHfkfxm1p3pnavBS6MiMcA0tFKu0R7vy58ObBI0nkUPy4KxR9B21XSwen1TODFwJMdjG/WNp9qMxuf8jrPKyhOtV1BccRTvb7T6s9bP1p73U7yuAmYVyvbLZW3FBFHAx+n+MXiayVtnuL7QLomNDcito+IS9qIw6wrTjxm43M5xU/pPxQRz0TEQxR/LntPilNvAJcBh6RrKVtS/Enjnzfo6zLgrelOtI2BtzQZ8zPASSlpIGkuxZ++/qexgpW0Y0RcGRGfpPgJ//LPKL83/YkNJL0k/QG8Ryj+pLpZT/lUm9n4LKe4bnNOrWyj8uI/cCFFIrqO4ojmoxFxr6Sdqh1FxNWSvknx68h3UfyJhrVExGJJLwJ+KikoEsQ7I/3VyDGcLOnFFEc5l6aYrqe4g+1qSaL4K6wHpfKnJV0HLIqIz7XRv9mY/OvUZmaWlU+1mZlZVk48ZmaWlROPmZll5cRjZmZZOfGYmVlWTjxmZpaVE4+ZmWX1PySL8UrifWTlAAAAAElFTkSuQmCC\n",
      "text/plain": [
       "<Figure size 432x288 with 1 Axes>"
      ]
     },
     "metadata": {},
     "output_type": "display_data"
    }
   ],
   "source": [
    "%matplotlib inline\n",
    "text4.dispersion_plot(['citizens','democracy','freedom','duties','America'])"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# random text"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 5,
   "metadata": {
    "scrolled": true
   },
   "outputs": [
    {
     "ename": "TypeError",
     "evalue": "generate() missing 1 required positional argument: 'words'",
     "output_type": "error",
     "traceback": [
      "\u001b[0;31m---------------------------------------------------------------------------\u001b[0m",
      "\u001b[0;31mTypeError\u001b[0m                                 Traceback (most recent call last)",
      "\u001b[0;32m<ipython-input-5-463eb7c367ab>\u001b[0m in \u001b[0;36m<module>\u001b[0;34m()\u001b[0m\n\u001b[0;32m----> 1\u001b[0;31m \u001b[0mtext3\u001b[0m\u001b[0;34m.\u001b[0m\u001b[0mgenerate\u001b[0m\u001b[0;34m(\u001b[0m\u001b[0;34m)\u001b[0m\u001b[0;34m\u001b[0m\u001b[0m\n\u001b[0m",
      "\u001b[0;31mTypeError\u001b[0m: generate() missing 1 required positional argument: 'words'"
     ]
    }
   ],
   "source": [
    "#text3.generate()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 8,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "['!',\n",
       " \"'\",\n",
       " '(',\n",
       " ')',\n",
       " ',',\n",
       " ',)',\n",
       " '.',\n",
       " '.)',\n",
       " ':',\n",
       " ';',\n",
       " ';)',\n",
       " '?',\n",
       " '?)',\n",
       " 'A',\n",
       " 'Abel',\n",
       " 'Abelmizraim',\n",
       " 'Abidah',\n",
       " 'Abide',\n",
       " 'Abimael',\n",
       " 'Abimelech',\n",
       " 'Abr',\n",
       " 'Abrah',\n",
       " 'Abraham',\n",
       " 'Abram',\n",
       " 'Accad',\n",
       " 'Achbor',\n",
       " 'Adah',\n",
       " 'Adam',\n",
       " 'Adbeel',\n",
       " 'Admah',\n",
       " 'Adullamite',\n",
       " 'After',\n",
       " 'Aholibamah',\n",
       " 'Ahuzzath',\n",
       " 'Ajah',\n",
       " 'Akan',\n",
       " 'All',\n",
       " 'Allonbachuth',\n",
       " 'Almighty',\n",
       " 'Almodad',\n",
       " 'Also',\n",
       " 'Alvah',\n",
       " 'Alvan',\n",
       " 'Am',\n",
       " 'Amal',\n",
       " 'Amalek',\n",
       " 'Amalekites',\n",
       " 'Ammon',\n",
       " 'Amorite',\n",
       " 'Amorites',\n",
       " 'Amraphel',\n",
       " 'An',\n",
       " 'Anah',\n",
       " 'Anamim',\n",
       " 'And',\n",
       " 'Aner',\n",
       " 'Angel',\n",
       " 'Appoint',\n",
       " 'Aram',\n",
       " 'Aran',\n",
       " 'Ararat',\n",
       " 'Arbah',\n",
       " 'Ard',\n",
       " 'Are',\n",
       " 'Areli',\n",
       " 'Arioch',\n",
       " 'Arise',\n",
       " 'Arkite',\n",
       " 'Arodi',\n",
       " 'Arphaxad',\n",
       " 'Art',\n",
       " 'Arvadite',\n",
       " 'As',\n",
       " 'Asenath',\n",
       " 'Ashbel',\n",
       " 'Asher',\n",
       " 'Ashkenaz',\n",
       " 'Ashteroth',\n",
       " 'Ask',\n",
       " 'Asshur',\n",
       " 'Asshurim',\n",
       " 'Assyr',\n",
       " 'Assyria',\n",
       " 'At',\n",
       " 'Atad',\n",
       " 'Avith',\n",
       " 'Baalhanan',\n",
       " 'Babel',\n",
       " 'Bashemath',\n",
       " 'Be',\n",
       " 'Because',\n",
       " 'Becher',\n",
       " 'Bedad',\n",
       " 'Beeri',\n",
       " 'Beerlahairoi',\n",
       " 'Beersheba',\n",
       " 'Behold',\n",
       " 'Bela',\n",
       " 'Belah',\n",
       " 'Benam',\n",
       " 'Benjamin',\n",
       " 'Beno',\n",
       " 'Beor',\n",
       " 'Bera',\n",
       " 'Bered',\n",
       " 'Beriah',\n",
       " 'Bethel',\n",
       " 'Bethlehem',\n",
       " 'Bethuel',\n",
       " 'Beware',\n",
       " 'Bilhah',\n",
       " 'Bilhan',\n",
       " 'Binding',\n",
       " 'Birsha',\n",
       " 'Bless',\n",
       " 'Blessed',\n",
       " 'Both',\n",
       " 'Bow',\n",
       " 'Bozrah',\n",
       " 'Bring',\n",
       " 'But',\n",
       " 'Buz',\n",
       " 'By',\n",
       " 'Cain',\n",
       " 'Cainan',\n",
       " 'Calah',\n",
       " 'Calneh',\n",
       " 'Can',\n",
       " 'Cana',\n",
       " 'Canaan',\n",
       " 'Canaanite',\n",
       " 'Canaanites',\n",
       " 'Canaanitish',\n",
       " 'Caphtorim',\n",
       " 'Carmi',\n",
       " 'Casluhim',\n",
       " 'Cast',\n",
       " 'Cause',\n",
       " 'Chaldees',\n",
       " 'Chedorlaomer',\n",
       " 'Cheran',\n",
       " 'Cherubims',\n",
       " 'Chesed',\n",
       " 'Chezib',\n",
       " 'Come',\n",
       " 'Cursed',\n",
       " 'Cush',\n",
       " 'Damascus',\n",
       " 'Dan',\n",
       " 'Day',\n",
       " 'Deborah',\n",
       " 'Dedan',\n",
       " 'Deliver',\n",
       " 'Diklah',\n",
       " 'Din',\n",
       " 'Dinah',\n",
       " 'Dinhabah',\n",
       " 'Discern',\n",
       " 'Dishan',\n",
       " 'Dishon',\n",
       " 'Do',\n",
       " 'Dodanim',\n",
       " 'Dothan',\n",
       " 'Drink',\n",
       " 'Duke',\n",
       " 'Dumah',\n",
       " 'Earth',\n",
       " 'Ebal',\n",
       " 'Eber',\n",
       " 'Edar',\n",
       " 'Eden',\n",
       " 'Edom',\n",
       " 'Edomites',\n",
       " 'Egy',\n",
       " 'Egypt',\n",
       " 'Egyptia',\n",
       " 'Egyptian',\n",
       " 'Egyptians',\n",
       " 'Ehi',\n",
       " 'Elah',\n",
       " 'Elam',\n",
       " 'Elbethel',\n",
       " 'Eldaah',\n",
       " 'EleloheIsrael',\n",
       " 'Eliezer',\n",
       " 'Eliphaz',\n",
       " 'Elishah',\n",
       " 'Ellasar',\n",
       " 'Elon',\n",
       " 'Elparan',\n",
       " 'Emins',\n",
       " 'En',\n",
       " 'Enmishpat',\n",
       " 'Eno',\n",
       " 'Enoch',\n",
       " 'Enos',\n",
       " 'Ephah',\n",
       " 'Epher',\n",
       " 'Ephra',\n",
       " 'Ephraim',\n",
       " 'Ephrath',\n",
       " 'Ephron',\n",
       " 'Er',\n",
       " 'Erech',\n",
       " 'Eri',\n",
       " 'Es',\n",
       " 'Esau',\n",
       " 'Escape',\n",
       " 'Esek',\n",
       " 'Eshban',\n",
       " 'Eshcol',\n",
       " 'Ethiopia',\n",
       " 'Euphrat',\n",
       " 'Euphrates',\n",
       " 'Eve',\n",
       " 'Even',\n",
       " 'Every',\n",
       " 'Except',\n",
       " 'Ezbon',\n",
       " 'Ezer',\n",
       " 'Fear',\n",
       " 'Feed',\n",
       " 'Fifteen',\n",
       " 'Fill',\n",
       " 'For',\n",
       " 'Forasmuch',\n",
       " 'Forgive',\n",
       " 'From',\n",
       " 'Fulfil',\n",
       " 'G',\n",
       " 'Gad',\n",
       " 'Gaham',\n",
       " 'Galeed',\n",
       " 'Gatam',\n",
       " 'Gather',\n",
       " 'Gaza',\n",
       " 'Gentiles',\n",
       " 'Gera',\n",
       " 'Gerar',\n",
       " 'Gershon',\n",
       " 'Get',\n",
       " 'Gether',\n",
       " 'Gihon',\n",
       " 'Gilead',\n",
       " 'Girgashites',\n",
       " 'Girgasite',\n",
       " 'Give',\n",
       " 'Go',\n",
       " 'God',\n",
       " 'Gomer',\n",
       " 'Gomorrah',\n",
       " 'Goshen',\n",
       " 'Guni',\n",
       " 'Hadad',\n",
       " 'Hadar',\n",
       " 'Hadoram',\n",
       " 'Hagar',\n",
       " 'Haggi',\n",
       " 'Hai',\n",
       " 'Ham',\n",
       " 'Hamathite',\n",
       " 'Hamor',\n",
       " 'Hamul',\n",
       " 'Hanoch',\n",
       " 'Happy',\n",
       " 'Haran',\n",
       " 'Hast',\n",
       " 'Haste',\n",
       " 'Have',\n",
       " 'Havilah',\n",
       " 'Hazarmaveth',\n",
       " 'Hazezontamar',\n",
       " 'Hazo',\n",
       " 'He',\n",
       " 'Hear',\n",
       " 'Heaven',\n",
       " 'Heber',\n",
       " 'Hebrew',\n",
       " 'Hebrews',\n",
       " 'Hebron',\n",
       " 'Hemam',\n",
       " 'Hemdan',\n",
       " 'Here',\n",
       " 'Hereby',\n",
       " 'Heth',\n",
       " 'Hezron',\n",
       " 'Hiddekel',\n",
       " 'Hinder',\n",
       " 'Hirah',\n",
       " 'His',\n",
       " 'Hitti',\n",
       " 'Hittite',\n",
       " 'Hittites',\n",
       " 'Hivite',\n",
       " 'Hobah',\n",
       " 'Hori',\n",
       " 'Horite',\n",
       " 'Horites',\n",
       " 'How',\n",
       " 'Hul',\n",
       " 'Huppim',\n",
       " 'Husham',\n",
       " 'Hushim',\n",
       " 'Huz',\n",
       " 'I',\n",
       " 'If',\n",
       " 'In',\n",
       " 'Irad',\n",
       " 'Iram',\n",
       " 'Is',\n",
       " 'Isa',\n",
       " 'Isaac',\n",
       " 'Iscah',\n",
       " 'Ishbak',\n",
       " 'Ishmael',\n",
       " 'Ishmeelites',\n",
       " 'Ishuah',\n",
       " 'Isra',\n",
       " 'Israel',\n",
       " 'Issachar',\n",
       " 'Isui',\n",
       " 'It',\n",
       " 'Ithran',\n",
       " 'Jaalam',\n",
       " 'Jabal',\n",
       " 'Jabbok',\n",
       " 'Jac',\n",
       " 'Jachin',\n",
       " 'Jacob',\n",
       " 'Jahleel',\n",
       " 'Jahzeel',\n",
       " 'Jamin',\n",
       " 'Japhe',\n",
       " 'Japheth',\n",
       " 'Jared',\n",
       " 'Javan',\n",
       " 'Jebusite',\n",
       " 'Jebusites',\n",
       " 'Jegarsahadutha',\n",
       " 'Jehovahjireh',\n",
       " 'Jemuel',\n",
       " 'Jerah',\n",
       " 'Jetheth',\n",
       " 'Jetur',\n",
       " 'Jeush',\n",
       " 'Jezer',\n",
       " 'Jidlaph',\n",
       " 'Jimnah',\n",
       " 'Job',\n",
       " 'Jobab',\n",
       " 'Jokshan',\n",
       " 'Joktan',\n",
       " 'Jordan',\n",
       " 'Joseph',\n",
       " 'Jubal',\n",
       " 'Judah',\n",
       " 'Judge',\n",
       " 'Judith',\n",
       " 'Kadesh',\n",
       " 'Kadmonites',\n",
       " 'Karnaim',\n",
       " 'Kedar',\n",
       " 'Kedemah',\n",
       " 'Kemuel',\n",
       " 'Kenaz',\n",
       " 'Kenites',\n",
       " 'Kenizzites',\n",
       " 'Keturah',\n",
       " 'Kiriathaim',\n",
       " 'Kirjatharba',\n",
       " 'Kittim',\n",
       " 'Know',\n",
       " 'Kohath',\n",
       " 'Kor',\n",
       " 'Korah',\n",
       " 'LO',\n",
       " 'LORD',\n",
       " 'Laban',\n",
       " 'Lahairoi',\n",
       " 'Lamech',\n",
       " 'Lasha',\n",
       " 'Lay',\n",
       " 'Leah',\n",
       " 'Lehabim',\n",
       " 'Lest',\n",
       " 'Let',\n",
       " 'Letushim',\n",
       " 'Leummim',\n",
       " 'Levi',\n",
       " 'Lie',\n",
       " 'Lift',\n",
       " 'Lo',\n",
       " 'Look',\n",
       " 'Lot',\n",
       " 'Lotan',\n",
       " 'Lud',\n",
       " 'Ludim',\n",
       " 'Luz',\n",
       " 'Maachah',\n",
       " 'Machir',\n",
       " 'Machpelah',\n",
       " 'Madai',\n",
       " 'Magdiel',\n",
       " 'Magog',\n",
       " 'Mahalaleel',\n",
       " 'Mahalath',\n",
       " 'Mahanaim',\n",
       " 'Make',\n",
       " 'Malchiel',\n",
       " 'Male',\n",
       " 'Mam',\n",
       " 'Mamre',\n",
       " 'Man',\n",
       " 'Manahath',\n",
       " 'Manass',\n",
       " 'Manasseh',\n",
       " 'Mash',\n",
       " 'Masrekah',\n",
       " 'Massa',\n",
       " 'Matred',\n",
       " 'Me',\n",
       " 'Medan',\n",
       " 'Mehetabel',\n",
       " 'Mehujael',\n",
       " 'Melchizedek',\n",
       " 'Merari',\n",
       " 'Mesha',\n",
       " 'Meshech',\n",
       " 'Mesopotamia',\n",
       " 'Methusa',\n",
       " 'Methusael',\n",
       " 'Methuselah',\n",
       " 'Mezahab',\n",
       " 'Mibsam',\n",
       " 'Mibzar',\n",
       " 'Midian',\n",
       " 'Midianites',\n",
       " 'Milcah',\n",
       " 'Mishma',\n",
       " 'Mizpah',\n",
       " 'Mizraim',\n",
       " 'Mizz',\n",
       " 'Moab',\n",
       " 'Moabites',\n",
       " 'Moreh',\n",
       " 'Moreover',\n",
       " 'Moriah',\n",
       " 'Muppim',\n",
       " 'My',\n",
       " 'Naamah',\n",
       " 'Naaman',\n",
       " 'Nahath',\n",
       " 'Nahor',\n",
       " 'Naphish',\n",
       " 'Naphtali',\n",
       " 'Naphtuhim',\n",
       " 'Nay',\n",
       " 'Nebajoth',\n",
       " 'Neither',\n",
       " 'Night',\n",
       " 'Nimrod',\n",
       " 'Nineveh',\n",
       " 'Noah',\n",
       " 'Nod',\n",
       " 'Not',\n",
       " 'Now',\n",
       " 'O',\n",
       " 'Obal',\n",
       " 'Of',\n",
       " 'Oh',\n",
       " 'Ohad',\n",
       " 'Omar',\n",
       " 'On',\n",
       " 'Onam',\n",
       " 'Onan',\n",
       " 'Only',\n",
       " 'Ophir',\n",
       " 'Our',\n",
       " 'Out',\n",
       " 'Padan',\n",
       " 'Padanaram',\n",
       " 'Paran',\n",
       " 'Pass',\n",
       " 'Pathrusim',\n",
       " 'Pau',\n",
       " 'Peace',\n",
       " 'Peleg',\n",
       " 'Peniel',\n",
       " 'Penuel',\n",
       " 'Peradventure',\n",
       " 'Perizzit',\n",
       " 'Perizzite',\n",
       " 'Perizzites',\n",
       " 'Phallu',\n",
       " 'Phara',\n",
       " 'Pharaoh',\n",
       " 'Pharez',\n",
       " 'Phichol',\n",
       " 'Philistim',\n",
       " 'Philistines',\n",
       " 'Phut',\n",
       " 'Phuvah',\n",
       " 'Pildash',\n",
       " 'Pinon',\n",
       " 'Pison',\n",
       " 'Potiphar',\n",
       " 'Potipherah',\n",
       " 'Put',\n",
       " 'Raamah',\n",
       " 'Rachel',\n",
       " 'Rameses',\n",
       " 'Rebek',\n",
       " 'Rebekah',\n",
       " 'Rehoboth',\n",
       " 'Remain',\n",
       " 'Rephaims',\n",
       " 'Resen',\n",
       " 'Return',\n",
       " 'Reu',\n",
       " 'Reub',\n",
       " 'Reuben',\n",
       " 'Reuel',\n",
       " 'Reumah',\n",
       " 'Riphath',\n",
       " 'Rosh',\n",
       " 'Sabtah',\n",
       " 'Sabtech',\n",
       " 'Said',\n",
       " 'Salah',\n",
       " 'Salem',\n",
       " 'Samlah',\n",
       " 'Sarah',\n",
       " 'Sarai',\n",
       " 'Saul',\n",
       " 'Save',\n",
       " 'Say',\n",
       " 'Se',\n",
       " 'Seba',\n",
       " 'See',\n",
       " 'Seeing',\n",
       " 'Seir',\n",
       " 'Sell',\n",
       " 'Send',\n",
       " 'Sephar',\n",
       " 'Serah',\n",
       " 'Sered',\n",
       " 'Serug',\n",
       " 'Set',\n",
       " 'Seth',\n",
       " 'Shalem',\n",
       " 'Shall',\n",
       " 'Shalt',\n",
       " 'Shammah',\n",
       " 'Shaul',\n",
       " 'Shaveh',\n",
       " 'She',\n",
       " 'Sheba',\n",
       " 'Shebah',\n",
       " 'Shechem',\n",
       " 'Shed',\n",
       " 'Shel',\n",
       " 'Shelah',\n",
       " 'Sheleph',\n",
       " 'Shem',\n",
       " 'Shemeber',\n",
       " 'Shepho',\n",
       " 'Shillem',\n",
       " 'Shiloh',\n",
       " 'Shimron',\n",
       " 'Shinab',\n",
       " 'Shinar',\n",
       " 'Shobal',\n",
       " 'Should',\n",
       " 'Shuah',\n",
       " 'Shuni',\n",
       " 'Shur',\n",
       " 'Sichem',\n",
       " 'Siddim',\n",
       " 'Sidon',\n",
       " 'Simeon',\n",
       " 'Sinite',\n",
       " 'Sitnah',\n",
       " 'Slay',\n",
       " 'So',\n",
       " 'Sod',\n",
       " 'Sodom',\n",
       " 'Sojourn',\n",
       " 'Some',\n",
       " 'Spake',\n",
       " 'Speak',\n",
       " 'Spirit',\n",
       " 'Stand',\n",
       " 'Succoth',\n",
       " 'Surely',\n",
       " 'Swear',\n",
       " 'Syrian',\n",
       " 'Take',\n",
       " 'Tamar',\n",
       " 'Tarshish',\n",
       " 'Tebah',\n",
       " 'Tell',\n",
       " 'Tema',\n",
       " 'Teman',\n",
       " 'Temani',\n",
       " 'Terah',\n",
       " 'Thahash',\n",
       " 'That',\n",
       " 'The',\n",
       " 'Then',\n",
       " 'There',\n",
       " 'Therefore',\n",
       " 'These',\n",
       " 'They',\n",
       " 'Thirty',\n",
       " 'This',\n",
       " 'Thorns',\n",
       " 'Thou',\n",
       " 'Thus',\n",
       " 'Thy',\n",
       " 'Tidal',\n",
       " 'Timna',\n",
       " 'Timnah',\n",
       " 'Timnath',\n",
       " 'Tiras',\n",
       " 'To',\n",
       " 'Togarmah',\n",
       " 'Tola',\n",
       " 'Tubal',\n",
       " 'Tubalcain',\n",
       " 'Twelve',\n",
       " 'Two',\n",
       " 'Unstable',\n",
       " 'Until',\n",
       " 'Unto',\n",
       " 'Up',\n",
       " 'Upon',\n",
       " 'Ur',\n",
       " 'Uz',\n",
       " 'Uzal',\n",
       " 'We',\n",
       " 'What',\n",
       " 'When',\n",
       " 'Whence',\n",
       " 'Where',\n",
       " 'Whereas',\n",
       " 'Wherefore',\n",
       " 'Which',\n",
       " 'While',\n",
       " 'Who',\n",
       " 'Whose',\n",
       " 'Whoso',\n",
       " 'Why',\n",
       " 'Wilt',\n",
       " 'With',\n",
       " 'Woman',\n",
       " 'Ye',\n",
       " 'Yea',\n",
       " 'Yet',\n",
       " 'Zaavan',\n",
       " 'Zaphnathpaaneah',\n",
       " 'Zar',\n",
       " 'Zarah',\n",
       " 'Zeboiim',\n",
       " 'Zeboim',\n",
       " 'Zebul',\n",
       " 'Zebulun',\n",
       " 'Zemarite',\n",
       " 'Zepho',\n",
       " 'Zerah',\n",
       " 'Zibeon',\n",
       " 'Zidon',\n",
       " 'Zillah',\n",
       " 'Zilpah',\n",
       " 'Zimran',\n",
       " 'Ziphion',\n",
       " 'Zo',\n",
       " 'Zoar',\n",
       " 'Zohar',\n",
       " 'Zuzims',\n",
       " 'a',\n",
       " 'abated',\n",
       " 'abide',\n",
       " 'able',\n",
       " 'abode',\n",
       " 'abomination',\n",
       " 'about',\n",
       " 'above',\n",
       " 'abroad',\n",
       " 'absent',\n",
       " 'abundantly',\n",
       " 'accept',\n",
       " 'accepted',\n",
       " 'according',\n",
       " 'acknowledged',\n",
       " 'activity',\n",
       " 'add',\n",
       " 'adder',\n",
       " 'afar',\n",
       " 'afflict',\n",
       " 'affliction',\n",
       " 'afraid',\n",
       " 'after',\n",
       " 'afterward',\n",
       " 'afterwards',\n",
       " 'aga',\n",
       " 'again',\n",
       " 'against',\n",
       " 'age',\n",
       " 'aileth',\n",
       " 'air',\n",
       " 'al',\n",
       " 'alive',\n",
       " 'all',\n",
       " 'almon',\n",
       " 'alo',\n",
       " 'alone',\n",
       " 'aloud',\n",
       " 'also',\n",
       " 'altar',\n",
       " 'altogether',\n",
       " 'always',\n",
       " 'am',\n",
       " 'among',\n",
       " 'amongst',\n",
       " 'an',\n",
       " 'and',\n",
       " 'angel',\n",
       " 'angels',\n",
       " 'anger',\n",
       " 'angry',\n",
       " 'anguish',\n",
       " 'anointedst',\n",
       " 'anoth',\n",
       " 'another',\n",
       " 'answer',\n",
       " 'answered',\n",
       " 'any',\n",
       " 'anything',\n",
       " 'appe',\n",
       " 'appear',\n",
       " 'appeared',\n",
       " 'appease',\n",
       " 'appoint',\n",
       " 'appointed',\n",
       " 'aprons',\n",
       " 'archer',\n",
       " 'archers',\n",
       " 'are',\n",
       " 'arise',\n",
       " 'ark',\n",
       " 'armed',\n",
       " 'arms',\n",
       " 'army',\n",
       " 'arose',\n",
       " 'arrayed',\n",
       " 'art',\n",
       " 'artificer',\n",
       " 'as',\n",
       " 'ascending',\n",
       " 'ash',\n",
       " 'ashamed',\n",
       " 'ask',\n",
       " 'asked',\n",
       " 'asketh',\n",
       " 'ass',\n",
       " 'assembly',\n",
       " 'asses',\n",
       " 'assigned',\n",
       " 'asswaged',\n",
       " 'at',\n",
       " 'attained',\n",
       " 'audience',\n",
       " 'avenged',\n",
       " 'aw',\n",
       " 'awaked',\n",
       " 'away',\n",
       " 'awoke',\n",
       " 'back',\n",
       " 'backward',\n",
       " 'bad',\n",
       " 'bade',\n",
       " 'badest',\n",
       " 'badne',\n",
       " 'bak',\n",
       " 'bake',\n",
       " 'bakemeats',\n",
       " 'baker',\n",
       " 'bakers',\n",
       " 'balm',\n",
       " 'bands',\n",
       " 'bank',\n",
       " 'bare',\n",
       " 'barr',\n",
       " 'barren',\n",
       " 'basket',\n",
       " 'baskets',\n",
       " 'battle',\n",
       " 'bdellium',\n",
       " 'be',\n",
       " 'bear',\n",
       " 'beari',\n",
       " 'bearing',\n",
       " 'beast',\n",
       " 'beasts',\n",
       " 'beautiful',\n",
       " 'became',\n",
       " 'because',\n",
       " 'become',\n",
       " 'bed',\n",
       " 'been',\n",
       " 'befall',\n",
       " 'befell',\n",
       " 'before',\n",
       " 'began',\n",
       " 'begat',\n",
       " 'beget',\n",
       " 'begettest',\n",
       " 'begin',\n",
       " 'beginning',\n",
       " 'begotten',\n",
       " 'beguiled',\n",
       " 'beheld',\n",
       " 'behind',\n",
       " 'behold',\n",
       " 'being',\n",
       " 'believed',\n",
       " 'belly',\n",
       " 'belong',\n",
       " 'beneath',\n",
       " 'bereaved',\n",
       " 'beside',\n",
       " 'besides',\n",
       " 'besought',\n",
       " 'best',\n",
       " 'betimes',\n",
       " 'better',\n",
       " 'between',\n",
       " 'betwixt',\n",
       " 'beyond',\n",
       " 'binding',\n",
       " 'bird',\n",
       " 'birds',\n",
       " 'birthday',\n",
       " 'birthright',\n",
       " 'biteth',\n",
       " 'bitter',\n",
       " 'blame',\n",
       " 'blameless',\n",
       " 'blasted',\n",
       " 'bless',\n",
       " 'blessed',\n",
       " 'blesseth',\n",
       " 'blessi',\n",
       " 'blessing',\n",
       " 'blessings',\n",
       " 'blindness',\n",
       " 'blood',\n",
       " 'blossoms',\n",
       " 'bodies',\n",
       " 'boldly',\n",
       " 'bondman',\n",
       " 'bondmen',\n",
       " 'bondwoman',\n",
       " 'bone',\n",
       " 'bones',\n",
       " 'book',\n",
       " 'booths',\n",
       " 'border',\n",
       " 'borders',\n",
       " 'born',\n",
       " 'bosom',\n",
       " 'both',\n",
       " 'bottle',\n",
       " 'bou',\n",
       " 'boug',\n",
       " 'bough',\n",
       " 'bought',\n",
       " 'bound',\n",
       " 'bow',\n",
       " 'bowed',\n",
       " 'bowels',\n",
       " 'bowing',\n",
       " 'boys',\n",
       " 'bracelets',\n",
       " 'branches',\n",
       " 'brass',\n",
       " 'bre',\n",
       " 'breach',\n",
       " 'bread',\n",
       " 'breadth',\n",
       " 'break',\n",
       " 'breaketh',\n",
       " 'breaking',\n",
       " 'breasts',\n",
       " 'breath',\n",
       " 'breathed',\n",
       " 'breed',\n",
       " 'brethren',\n",
       " 'brick',\n",
       " 'brimstone',\n",
       " 'bring',\n",
       " 'brink',\n",
       " 'broken',\n",
       " 'brook',\n",
       " 'broth',\n",
       " 'brother',\n",
       " 'brought',\n",
       " 'brown',\n",
       " 'bruise',\n",
       " 'budded',\n",
       " 'build',\n",
       " 'builded',\n",
       " 'built',\n",
       " 'bulls',\n",
       " 'bundle',\n",
       " 'bundles',\n",
       " 'burdens',\n",
       " 'buried',\n",
       " 'burn',\n",
       " 'burning',\n",
       " 'burnt',\n",
       " 'bury',\n",
       " 'buryingplace',\n",
       " 'business',\n",
       " 'but',\n",
       " 'butler',\n",
       " 'butlers',\n",
       " 'butlership',\n",
       " 'butter',\n",
       " 'buy',\n",
       " 'by',\n",
       " 'cakes',\n",
       " 'calf',\n",
       " 'call',\n",
       " 'called',\n",
       " 'came',\n",
       " 'camel',\n",
       " 'camels',\n",
       " 'camest',\n",
       " 'can',\n",
       " 'cannot',\n",
       " 'canst',\n",
       " 'captain',\n",
       " 'captive',\n",
       " 'captives',\n",
       " 'carcases',\n",
       " 'carried',\n",
       " 'carry',\n",
       " 'cast',\n",
       " 'castles',\n",
       " 'catt',\n",
       " 'cattle',\n",
       " 'caught',\n",
       " 'cause',\n",
       " 'caused',\n",
       " 'cave',\n",
       " 'cease',\n",
       " 'ceased',\n",
       " 'certain',\n",
       " 'certainly',\n",
       " 'chain',\n",
       " 'chamber',\n",
       " 'change',\n",
       " 'changed',\n",
       " 'changes',\n",
       " 'charge',\n",
       " 'charged',\n",
       " 'chariot',\n",
       " 'chariots',\n",
       " 'chesnut',\n",
       " 'chi',\n",
       " 'chief',\n",
       " 'child',\n",
       " 'childless',\n",
       " 'childr',\n",
       " 'children',\n",
       " 'chode',\n",
       " 'choice',\n",
       " 'chose',\n",
       " 'circumcis',\n",
       " 'circumcise',\n",
       " 'circumcised',\n",
       " 'citi',\n",
       " 'cities',\n",
       " 'city',\n",
       " 'clave',\n",
       " 'clean',\n",
       " 'clear',\n",
       " 'cleave',\n",
       " 'clo',\n",
       " 'closed',\n",
       " 'clothed',\n",
       " 'clothes',\n",
       " 'cloud',\n",
       " 'clusters',\n",
       " 'co',\n",
       " 'coat',\n",
       " 'coats',\n",
       " 'coffin',\n",
       " 'cold',\n",
       " ...]"
      ]
     },
     "execution_count": 8,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "sorted(set(text3))"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# 每个词平均被使用的次数"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 9,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "16.050197203298673"
      ]
     },
     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "len(text3)/len(set(text3))"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# 词出现的次数"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 10,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "5"
      ]
     },
     "execution_count": 10,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "text3.count(\"smote\")"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.6.5"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 2
}
